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3£»e>nfciE;&jeiBHrt©tt*'<* wns*f©* 1 fe y 

FW-ft57f^t, (b) «3i»ft^b;i' 

v 3 4iotfHitfefci:, ®&mmffiimvv y y±x- 
v2*i"C»i: , r* : ?«>^»e>n&iE*j^6Hrt©»*^ 

(c) SUBv^o^Dv^fflliFlBWW:**^ 

;H£ffi©H§3-fey h**— f^S^y^S^A/t-fifc 

[If5RiS2] WbV 2 £ji.ottajffc© 
5£»£nfcIE;£?fc$Gffll*)©S§lfey h©»§^ WHS 

SJfrottHrt-&M>C, 1 /2WmtMB£#V * K±T- V 
l*if«C»i:-r*iE*JJg«Hl rt©8ffl©»*^ bA-fc 
1ffi©Sg 1 fe y h * -tf- — ^-f 3 * 5 1 s» ^£ $ 5> C ^tr Z t 

into®??* h-y--^-3»stt©»4»*iiHes*»"3fc"y- 

fc> SaiSXflMfclS^U y 1*±-CV 2 * 
*»e.*ifciESJBI6fflrt©JB2-fev b©»j£^ WHS 

fe»o»tu-rfe«>Cs i / 6 mmm\m.if u y k ±-e v 

2*^^i:-rSiE^95Hl |*l©8fl©IJj£^ WM!£ 
1i©gg2fey bfctf— ^NB^viTfcS&fcd'tJifc 

<5»#k i t«a«©tt*^^ b;i>*ifjrra 

[11*114] V2*H8'JM'-hS»3^bft*1"*» 
ffltS^TfT'fc, ttttgu©^?:? (c) £**y 

s * y y h s £ tr t £ i^s £ -r a at *a 

l tlBii©»jS^^ b;i/£Jtjrf ££©©:77* Mr- 

T V 3 £ "f /&> t: f 3 ^ ftfejE£fl3ftHrt©*6 3 

■fey b©9Jjt^ b;u^<&1t-f-rSmjBB^xy7' 



(fibk-j-SiE^rJBIEHl rt©8ffl©tt#^ b;HR»©" 

©77^ h-y-f-awfint©***!*****^^"^— 

t* v 3 & 4 1 <c> t r s =f «>* © ft fc ji *jg<6 h com 3 
■fey bCfi&iS^ h^HRHftV— f-t zmzzTvy 
it, mil3^3fey hotoSt* h;iHRtt>t>©JRK:^ b 
»#©«W*^* y 7t57r y 7* £ 6 t^tri t £ 

WfRfc-rsiMS! i tiiaR©**^^ b;i/^ffi^-rsfc 

[IS*il7] mmm l fey b©«jt^-^ 
-^-TSSuiBT-^y^ttv ^KDliP5*^afe©t:^i 
©7-f;i/*SttfflLTflSIB!Bi-fef b©»)&-<^ h;HI^ 
il*1f-ft s» * 6 W*, mii3^2fey h 
©»*-<* b;i/^«^-y--^-TSmiiB77 i y^lis ^2 
©ffiP^^ff a fc«>t:» 2 w ^ffiffl bTflufB^ 2 

•fey b©«j^^^ h;v«liS:-y-— ft5^f97"^?. 
{3^, «ufBSI3fey hot&t^^ ^"T 
SffiiB^^ y 7"t±s m 3 (Dffi?$* ft d fcJ&CJB 3 ©7 
;^S^fflbTmllB^3fey b©»^^^ b;i4^«^1f 
-f-t Z> ^ r- y rf* 2 & ic^tr t h f S if*JH 
l I3|3«6©ffij^^^ h;uS:}t^-rsfc©©7 77 MJ-- 

^*e»nfcjE^lgllirt©^2fey b©!!;^^^ 
m*V-*--r*ffl1BX7-y-7tiis (a) V2A s "4"Ca3 

ssa^ i/3'vvM*©3«*v 2 fc«fceffi©R 

D n 7 b t) o 1 / 2 "VHfie&lf— ^"T -S ^ r- y 7" 
hs (b) V2#3-t-«Wf b;i/f*8tBli, 
v 2 £«t>o£^ i/3-<.;u)l«jg©4^ 

ftS^ryT'^ (c) V 2 *52o©=}-^— 
h;HBfc*a»&iiN 2o©rj-^--©4'©J: Dffiv-'R 

*»ottlB3— ^— ^©^©7^ >CS4.3£VM/3^ 

C^trd h^m^-T 1 CfBtt©**-^^ 
i&Ji«-ri>fci*©7 77 Mt-^»Btt©Jfc6tt*»tfE 

[lt*H9] lo©v^Dypy^fc;tt-r?)«jg&i!j 

-fbt3*5t^Sffib^-«> h;u<&Jll^1-2)fc©©s aj£tt©fe 
Stt*llflEft*>ofe1f-?-*JS-C»'5T, (a) SSI 
«M*1f"5fc»©»l7'f;^*fl|V'iTs 
■ f;i/V- 2 ^ji,o{j-tBl-fc©t, Vift* ^fc-ra^u y K 
l*i©^lfey Y(om^^ \>M8&*V-3-t2>*7-y 



(3) 



4$M¥13-189934 



7ts (b) i2»afefcfflf2 7^^4ffl 
ut\ AMbt^^b^VaftJiotttHf fcftCs V 2 fe 

ft-ft-f^S^y?*^ (c) m3ffi?3*ft5tz&> 

7 Wi/fem5rr&fcto©®J&14©&3ib^JSfefe^& 
OUS** 7 -f ©GSffl ft * & t d k ft W« i: -r s 

»«3R9k:ib«©»*^ wi/feaarrsfctooaufctt 

[flt&H Hi * 1 »M*fT-5 *:«>©* 1 ? J** ft 

©^#^c7-f ;i/^©ftfflft*e>t^N ^SISftfT? 
fc#>©H 2 7 4)Uf> ftfflVN-C It— 5f!lfB^r- s> 7* 

<oi*v^iB*ff5fe«)®a^7-f ;i/*©<£fflfe££> 
t^ir;rfcfe^t-r2>li:*JM9£i3®©ib§^ bA- 
ft JtJfef 3 fc *&© Wfctt© ;& £ »> # *§« fe fe 3 felt 

[ft$3I12] *3MIB*fT'3feM>©83:M;i'*fe 

fliivt-y-— ^-rstuiBx^ v so^v^wft^f ^ fc 

7 ©<£ffl fe 2 £ £^tr £ i: fe^ i: "T 6 
W^9f 1 1 t3|B«i©ib£^ b;v-*J65fersfc«>©JtoB 
tt©& £■& t Jtfife 4) -3 tzV-— ?J3&o 

m&m 13] us 1 ma*ff5 tzubom 1 7 ^ ft 

fflV^T-y--5=-r-5BUiB^5 i ^7'{^ 2X2©#I71/- 

a few wr * fc «>©}rwb 7 ©ffiffi ft ^ e. c^tr 

•r 3 »©isi&tt©» a» s»*fe t> ^> 
NM&H 14] mi ffiF^ fefr-5 fc«>©!g 1 7 ft 

a fettiwr a fe£35£«S7f2;7 -f W ©<£E ft * c $ 

g^M.flft^fcftCS^^^ft^T-tf-'?- 

-rsiUB^^^^t^ Mft^ffi®*fttttb<Dtt#.y 
■i )\># <D&m fe £ fc^tr - fe^ss -r S W^iS 9 1 
IB®©!*^^ h;ufef6^-T5fc«)©®(51t©$>5S&^ 

j#afe*ofc-9--^So 

[fltt&R 1 5 ] $ 3«Mfe*r-5 fci*©!fS 3 7 fe 

&©£;£{* 7 ©($ffl fe £ h £ ^tr £ £ fe^® £ f 
4»*5il 4fcI3«©»S"** Hi/feat** a fc»©fc 

^bt^w-aibt^ wi/fejfurrs;feto©s austt© 



l©ttg*l*fe1J---f-1" a*^^^ (b) Hul3£§ 
1 ©ftitSI)£-<7 h;i/fc«i«B» 1 ©»*»*fe^<b^ 
%7,=rv?t (c) /J?a<fc*»lo©!|2©«i*«S 
T-huIB-x'^ d^D'^ ©'J*& < i: 4> 1 o©^ 2 ©ftMffii 
^^hM1t-? L n^fy7'^ (d) HuiBiJ>*e 
< hfe io©^2©ft®Kit-<7 b;Pi:, iFi3'>«c<i: 

1 o©^ 2 (Dmmmh^n^it-ti,^ v ?t s 

(e) miiB^li:'J>^<t i fel^©ft®»»&^^b>'i' 

fefflv^-cs^K•r?)7^^7•tfe^^T-m5 3i:fe ! tts^fc 
Kfefe-^fe-y-— ^*^o 

[llsRII 17] V- ^M^*lJ«S*fefflV^Tfl^iIBfl:ji 
^lbt^7 b^aiR-r-5mllB^7 L y7'{^ Bul3ft®ffi)^ 
h;i/i:i)l3Sjiil)^llSS©ffi^fe^'^2>fe* t;: S 

•y-S7x^7fe^e)t:$trc:i:fe#®i:-r^iS**i 6 

cE«oiit'«^ h;vfeif^-rsfc«>©3gjiBffi©$>si> 

18] < h i> l o©^ 2 ©ib£#&KT*'J> 
i:fe i-z>com2<DmM&^^ h;i/fett— ^-r&fti 
S37^ >^ 7*J±> mSBS6 1 ©!&&«i§E<k ilflM^* < i: . 
*)10©*2©»*»ftTJN 'J>*<fcfel^©^2©g 

Mffij^^^ h;ufelt— ^"f^^^^yfe^^ti^trcii: 
fett»i:"r*HI#9l 1 6 fcilB®©!!)^^^ v;i/fe«i^-r 
£> fcK>©iffl5Btt©fe SKl^JtfiEfe fe-p felt— 
[8I#3! 19] V- h-S^^S^fefflV^T^ilBft® 

b;i/fe®*K-rsfc«)^ "s^+Lxty y m<o 

U- h^$iJ^Sipfe{Sffl-rS7^^7fe$e»t$tr^ 

fcfe^ii-rsff^ia i 6i3i3«©i!?^^7 b^feji^ 

f i> tz tb<D : mM.<D-3b % ft t fe -3 felt— 

tb&^<7 ^;^feB■^^tttit■c:i:^3«t»^^ ttj§fflfflWjffl^F 
^•fbtasttsttS"** h ;u fe ji^I" s <*© jS*Stt© 
^B^l§lSSfefe-3fe•9•-^•^^St'fe^"C^ < a > WB 
?i? d >y ^© l ocilt lo 
©li)&liafe^-^1"S^v L ^7J:^ (b) H»-f3 

-3T#W!3nfcRj**flF^ (VLC) 7 1 " — y")^if £><do 

(c) ^^©JSfiSKt^l^TBillBSS^illlt^ h 
;i/fe^ib-r 5Xf >f 7*i: fe^T-figS - tztmt? 

zmz^z h;i/fe«utr5&«)©ai6tt©**»*»* 
fefe^fctt— ?-:m> 

[tt*^ 2 1] io©7^D/oy^t:^t5io© 

ttii^ftc^tjaKt^* b;ufe^-rsfc«)©->7 

x^T'fe-3T> (a) ataa*** b;i/V 2 &J&olj- 
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*? WHS***— lTObSlis (b) 

mvuSLW s» K±TV2ftt«i&i:"ra : ?i&*«>&tifciE 

4fctf>©S5 2i3^ftl8&^ (c) AutB^P^P^ 

IE£J|gttHrt®&3-te9 r©«lS^* b-MSffift-tf— 
f 5 fc 3 *Wb«i: ft - ft#£& f 

CMrm 2 2 ] HuiBSI 1 ffWiS, « 2 ff*4b8« «t 

*nt#K2 ltnaR©*^* b^ftjt^-r a &#>©-> 

[$BBJ3©!$NB&i&i!§] 
[0 0 0 1] 

[0 0 0 2] 

[t£*©8fl5] fiE*©»£*i£#i*fc:feV^ W^b-T 

16x1 6tf^-fe;i/©i^i— y--f x©iB0^D>y^t^i!i 
( ) rt:/o us© 

glV^ P >j/^ l^-A|*)©ft*> — gCTS 

D/DVi' ^WlMtts V £ D 7P y >7 © 2 

b;u ( "Sb£^ b;^' ) cfc-sTSEBisJia. ft 
b;i/©&j*#ttMJMHiT?ai£3ft3. 
[0 0 0 3] 0i.l3.fcf, I-?^D/D«^l:gti- srr& 

^b;H± (0, 0) T-&£ 0 «*>-»■*- S#JRv*o 

(2, 3) T'feSo frj^ttflb&t* r/Wk 7K¥e£# 
X#VSifi$#Y#gf&IIi&teft#bT^S©T\ HR 

mm (xo: ) ©Jtsft* 

-r-shsdo hic^Tx ^b;wvi= (i, 1) 

£ b;i/ftSU:>bTU3o 

[0004] »mis/->«t>©»»ar^x^ bi±> b*> 

fcM£f!>b&V^ K©ffli)^tts XMY^lfilCS^fe^ 



/NiIj?il©ii#ffiftm-5;:i:7} s T-£, 1/NB* 
i/N^;i/) ©*»Kft*1-ai:3ER3ft4. 

[0 0 0 5] I TU«ft»flMMMb«H© "H. 2 6 L 
afSKJ*1 _ 5SI36l*V*>»t^©JtS» (Response to Call 

for Proposals for H.26L) " (Q. 1 5/SG1 6, 
X9Q 15-F-l 1, V>;W 1 9 9 8^1 

uit umfmiwmmtmKD k h. 2 6 L«j&c«-r 

STelenor ©St^CSS'fb (Enhancement of theTe 
lenor Proposal for H.26L) " (Q. 1 5/SG1 6, 
*»Q1 5-G-2 5, t>bU-, 1 999#2fi) 
t&UT, Gisle Bj oitegaardlt i 
BWrafttfttl&H. 26L ("Teleno rfffft 
S" ) CHbi/a^i'W*©**'** b;i/i:ms<*tt 

Mi«»©«ffl*a3i!bfc. .rnftUffi-rafctos;:, Te 

1 e n o rWHm&s A*#tt»HB7 -f ftHWC 
#IS7V-A43X 3»|«-*-*frXtt 

ffiOjt^UftifeSfcfS. ll©4A6nfcv*B?D 
y * Teleno r«Wb«tUBi»*^ 

b;i/ft2-o©^^^XT»^-rs. !P*b, 19«NbBfit. 

jn&i/siintflME©'** vjuvi/ai&mt*» h 

1 ©0JTWU 3 x 3I»I7 V— Art© (16X16 
Eiift) CD^8 7"D-y^S:I^ ^©i^^Hb^^ b 
^Vi/ 3 = (VX, VY) = (1+1/3, 1) CHS 
XP v Sftfe— SSfT £ XD V * ftM-oltffi 
-To iCTelenorffWbStt, *o*»©IHI)Bft* 
bTl^o 5fc-f, l/S^Hi*©©^^^ b;i/fttt» 
f S 7?7, MJ-— ^iWfc»H«tilD5F 

»7^;i/^l: (±S»C*5l">T) fiBffl-TSCltT-feSo 

*§*^bT, fr»bfeSi&*^ b;Hi> s®-ett«c<^ 
jfc*3feB3fite«a:fW«ftSb*«CWflliK:o<. 
13, i(DTelenor^« l/3IB^t@^ 
bfc*»b-hS»«£**CJ:*|||tftfflV\ ^©fc 

V->o PFglCs d©Teleno r^^bSCUSS^F^ 
( "VLC" ) l/31B^C@^bfcm* 
^£*bTV^S©T^ »tt*»!tt*rbT»6T*-ra«: 

[0006] g*fi<ffle>nT^sWrtiiijE®s^T{±s 
l/zwmmtT**^ b;u*jt«bTw^br4 

^©llStt, t»£Hu©W?5tJ:nii, J: DlSi^XttJ: 
t) Mtettft * 1 4 9b *»JST BitSiWl: & 2> « » t 

iajncEKtww^fen&v'ifcftviftbTv^fc*'^^ 
b*b^* s fes c:n^©^fti©w^{i> ftMbv 

-bS»fJJ^S^ftfflV^Tffi&t^^ rJl/fcjtJfcb-Cfce. 
ltW©^<b ft »i> ^ •& ?> fc to t if J: -5 )&a^e©i£b 
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[0 0 0 7] ^i.oim.m(Dm%(0 1 ~z>£ Its Ber 
nd -Gi r odOiX (WtGi r o d©fi:x;i:li)ei3 
"#2&^;vMJKCJ;3i!j§^r?iflJ (Motion-Corn 
pensating Prediction with Fractional-Pel Accurac 
y) " (IEEE Sfl^, %A 1^4-9, 6 0 4- 

6 1 21, 1 9 9 3*P4E) ifiM&W btlZx, ^OGir 

fe^,, Girodit &frmf&£.mz#vz>wtmm%'< 
tztbtzmt&^mtt&fr ( "mad" ) ©«¥&fflVN 

fee J»b£nfe{5*fc:fi<fefc£# 
ofc©tftiJPfi;**lS. ft^tCs GirodS, ^ifflUM 

[0 0 0 8] flfe©t£jiu©5Jf2Si: LT. Smita Gu 
ptaMAllen GershoCiX (Wt "G 
upt aOMX' tBSIB-TS) *'^iii^©iti&*t^ 

(On Fractional Pixel Motion Estimation) " (SP 
IE VCIPm 20 94f40 8-4 1 9M, 
y^ijr^, 1 9 9 3^1 1^) A s ^lf e.tl2>o d©G 
uptaffli^ t!)iliJI«il©fc^^SI5^-ili^**ST'ilj 
*^ b;i/£ttftbil^bT^^bt-&;Si££i6^bT 

vn«>. Guptaoim ^^mmm ( "ms 
e" ) tmmm?£izm'3<4k&.*fflm\s, z<d&^ 

&m*%^x&%.isn, zLoimmmmzmfrw v^tz 

titzo inn ?§asnfc (gp^-sii^) ^s^t-fc 
So m^(DW}^^ hwwattt* 

rtl»©»&©«S»*'<* bMt£ (On the Optimal H 
otion Vector Accuracy for Block-Based Hotion-Compe 
nsated Video Coders) " (IST/SPIE f-fy 
*;i/fi)iIi©ffliS£38 : 7fr1 U XA 3 0 2-3 

1 41, f>yat, 1 9 9 6^2^) mt. Rib 
asOin&fcHStE-f 5) K*5b>t\ Jordi Rib 
as-CorberaMDavid L. Neuh 
offfct. llb*»§E©trv bu— h^©W**5 ff -»Wb 



*>©^^i£g^btc uORibas ©tm&ttv BWM 

8sSIBS£U 35WJB«mi©**%*bfc. ftiSftlbt^ 
£ b;i/i±MSEfcft/b£f 6£h£«fc D«JiU SiHt 
fltttb- bM»©KWbC J: t)#a*nfc»o*»©a55 

[ o o i o ] mx "«t t) *&v> e s> b b- b if ©^aassfb 

£M1~3frbl^7llife©il^ (Proposal for a new c 
ore experiment on Prediction enhancement at higher 
bitrates) " (ISO/IEC JTC1/SC29 
/WG 1 1 ftffi&lF&j^OfSF-f-fbs MP E G 9 7/ 1 8 
2 7,-fet'JT, 1 9 9 7^2^) M"l/4^l'i 
*««©fc#>©*li3 fc«»bfcf^«©ttffilWfli 
(Performance Evaluatior of a Reduced Complexity I 
mplemetatior for Quarter Pel Motion Compensatio 
r) " (ISO/IEC JTC 1/SC2 9/WG 1 
1 WMRVtiPom^ib, MP E G 9 7/3 1 4 6 , V 
>t?3-fe> 1 9 9 8*fi;!) C£^T, Ur i ch B 
enzler its S>Bb>— v->*©fc#>£ 1 /4 
ffimomg^t? b^offiffli^ MPEG4tbifflft-^fba 

moz <o£.M/»izffl37 ■< ;i/^©^ffl*s^bfc. b*> 

b&* s £s Benzlerfc 1 / 4^;i/©ib t b 
;i/©26j|£G i r o d©7 77, W—?-$L&*m^tz° 
Benzlerli, H&3lf IB?-* ;i/**%j*b&*"3 
fc#> £§ 1 Sjrcl^7 £G8ffl bTSg 2 gtfgT' J: 

£^y- b;^ 1 O£0 Vi? D 51 i7 rtCD^T© WJ:* 
- K (fllitBU 16X16, 4-8x8, 16-4x4 

«£) 1 / t^nmxmn^xmm®*- b*** 

SWafcifrfc. Benzlerii 

g^t, Benzlertt, 1 / 2 »V 1 / 4 H*»lK 

( "VLC" ) r— TVl/fcEOTftS"** b;i/SW^<b 
bfc. 

[0 0 11] 

[»W4»»*bi -5 k -TiRlffl] ±3zE©#%35:ilRttx * 
®•^bbfc^-^M»a^PS:fflt^Ta!)&^^ h;U&«l^b 
T4o^-Ts il»J:d&*lp©i2itt«tt**lfflbTttJI©a 

n^sisiitbTVMftv^. $e>t N cm?>©#%^&N 

[0 0 12] 

[HlHft»*"f 4 fe©©^S] #«W© 1 o©*W»» 
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[0 0 13] ^B^JjiM^ffll^iu ffijffliff^fblitt 
ftHSttMfttt&flliE ( "AM A" ) tttLXm&XZ 

a. 

[0014] *»w©»* ^mtmmcD i o& N 

msH Ctf-:/t^-te;i/) ^ffl^*s^S7 7^MJ--^«SBS 
o^U y FrtTv *2-fev b©iti£^ b;HR*lft»JR 

bti®s:i^^ Mi/Vstjaottm-r. *©a N v 3 

Id Hud « 4) o^#>£«> 6 ftfcIE#JB^©«#lffl3MMfc 

[0015] #$6in©«i©#* b^*8B&JBK#v^ 
wows©*)*'** }>)mfe&m&m%%®mz&^xg 

[0016] #$&BJ!©SiJ©$?a: bV^3fcJB&»tt* b 

m# (rd) x-mis-cmmv)^* Y)\stmm*3®R? 

[0017] 6) fcJBU©#* bl^JIIBBStk 

*3»«tRiSE«^9fb (vlc) $sfc«t^ 

feVLCr-^MffiffltSo 
[0 0 18] #»iD!©±aLRV{lb©Bflk watazw^ 

[0 0 19] 

&i8ffl-et£c *%BJl(i N X, ftQl©1Nt£:l3tiTlBtt 
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[^HB9lliD«] 

1 . Title of Invention 

- m METHODS FOR MOTION ESTIMATION WITH 

ADAPTIVE MOTION ACCURACY 

2 . Claims 



1. A fast-search adaptive motion accuracy search method for estimating 
motion vectors in motion-compensated video coding by finding a best motion vector far a 
rnacroblock, said method comprising the steps of: 

(a) searching a first set of motion vector candidates in a grid of sub-pixel 
resolution of a predetermined square radius centered on V, to find a 
best motion vector V 2 ; 

(b) searching a second set of motion vector candidates in a grid of sub- 
pixel resolution of a predetennined square radius centered on V 2 to 
find a best motion vector V 3 ; and 

(c) searching a third set of motion vector candidates in a grid of sub-pixel 
resolution of a predetermined square radius centered on V 3 to find 
said best motion vector of said macroblock. 

2. The method of claim 1, said step of searching a first set of motion 
vector candidates in a grid of sub-pixel resolution of a predetennined square radius centered 
on V, to find a best motion vector V 2 further comprising the step of searching a first set of 
eight motion vector candidates in a grid of 1/2-pixel resolution of square radius 1 centered 
on Vj to find a best motion vector V 2 . 

3. The method of claim 1, said step of searching a second set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermined square radius centered 
on V 2 to find a best motion vector V 3 further comprising the step of searching a second set 
of eight motion vector candidates in a grid of 1/6-pixel resolution of square radius 1 centered 
on V 2 to find a best motion vector V 3 . 
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4. The method of claim 1 further comprising the steps of using V 2 as the 
motion vector for the block if V 2 has the smallest rate-distortion cost and skipping step (c) - . 
of claim 1. 

5. The method of claim 1, said step of searching a third set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermined square radius centered 
on V 3 to find said best motion vector of said macroblock farther comprising the step of 
searching a third set of eight motion vector candidates in a grid of 1/6-pixel resolution of 
Bquare radius 1 centered on V 3 to find said best motion vector of said macroblock. 

6. The method of claim 1, said step of searching a third set of motion 
vector candidates in a grid of sub-pixel resolution of a predeienriined square radius centered 
on V 3 to find said best motion vector of said macroblock further comprising the step of 
skipping motion vector candidates of said third set of motion vector candidates that have 
already been tested. 

7. Hie method of claim 1 further wherein said step of searching said first 
set of motion vector candidates further comprises the step of searching said first set of 
motion vector candidates using a first filter to do a first inteipolation, said step of searching 
said second set of motion vector candidates further comprises the step of searching said 
second set of motion vector candidates using a second filter to do a second interpolation, and 
said step of searching said third set of motion vector candidates further comprises the step 
of searching said third set of motion vector candidates using a third filter to do a third 
interpolation. 

8 . The method of claim 1 , said step of searching a second set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermined square radius centered 
on V 2 to find a best motion vector V 3 further comprising the steps of: 

(a) searching three candidates of 1/3-pel accuracy V 2 and a 1/2-pel 
location with the next lowest RD cost if V 2 is at the center; 

(b) searching four vector candidates of 1/3-pel accuracy that are closest 
to V 2 if V 2 is a comer vector, and 
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(c) determining which of two comers has lower RD cost and searching 
four vector candidates of 1/3-pel accuracy that are closest to a line 
between said corner with lower RD cost, if V 2 is between two corners 
vectors. 

9. An adaptive motion accuracy search method for estimating motion 
vectors in motion-compensated video coding by finding a best motion vector for a 
macroblock, said method comprising the steps of: 

(a) searching a first set of motion vector candidates in a grid centered on 
V x to find a best motion vector V 2 using a first filter to do a first 
interpolation; 

(b) searching a second set of motion vector candidates in a grid centered 
on V a to find a best motion vector V 3 using a second filter to do a 
second interpolation; and 

(c) searching a third set of motion vector candidates in a grid centered on 
V 3 to find said best motion vector of said macroblock using a third 
filter to do a third interpolation. 

1 0. The method of claim 9 wherein said step of searching using, a first 
filter to do a first interpolation further comprises using a simple filter to do a coarse 
interpolation. 

1 1 . The method of claim 9 wherein said step of searching using a first 
filter to do a first interpolation further comprises using a simple filter to do a coarse 
interpolation and said step of searching using a second filter to do a second interpolation 
further comprises using a complex filter to do a fine interpolation. 
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12. The method of claim 1 1 wherein said step of searching using a third 

filter to do a third interpolation further comprises using a complex filter to do a fine * . . 
interpolation. 

13. The method of claim 9 wherein said step of searching using a first 
filter to do a first interpolation further comprises using a bilinear filter to inteipolate the 
reference frame by 2x2. 

14. The method of claim 9 wherein said step- of searching using a first 
filter to do a first' interpolation further, comprises using a bilinear filter to interpolate the 
reference frame by 2x2 and said step. of searching using a second filter to do a second 
interpolation further comprises using a cubic filter to do a fine interpolation, 

15. The method of claim 14 wherein said step of searching using a third 
filter to do a third interpolation further comprises using a cubic filter to do a fine 
interpolation. 

16. An adaptive motion accuracy search method for estimating motion 
vectors in motion-compensated video coding by finding a best motion vector for a 
macroblock, said method comprising the steps of: 

(a) searching at a first motion accuracy for a first best motion vector of 
said macroblock; 

(b) encoding said first best motion vector and said first motion accuracy; 

(c) searching for at least one second best motion vector of said 
macroblock at an at least one second motion accuracy; 

(d) encoding said at least one second best motion vector and said at least 
one second motion accuracy, and 

(e) selecting the best motion vector of said first and at least one best 
motion vectors using rate-distortion criteria. 
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17. The method of claim 16 wherein said step of selecting the best motion 
" vector using rate-distortion criteria further comprises the step of said rate-distortion criteria 

adapting according to the different motion accuracies to determine both the best motion 
vectors and the best* motion accuracies. 

1 8 . The method of claim 1 6, said step of searching for at least one second 
best motion vector at an at least one second motion accuracy further comprising the step of 
searching for atleast one second best motion vector of said macroblock at an at least one 
second motion accuracy that is finer than said first motion accuracy. 

19. The method of claim 16 wherein said step of selecting the best motion 
vector using rate-distortion criteria further comprises the step of using rate-distortion criteria 
of the type "distortion + L*Bits" to select the best motion vector. 

20. An adaptive motion accuracy search method for estimating motion 
vectors in motion-compensated video coding by finding a best motion vector for a 
macroblock, said method comprising the steps of: 

(a) searching at a motion accuracy for a best motion vector of said 
macroblock; 

(b) encoding said motion accuracy using a code from a VLC table that is 
interpreted differently at different coding units according to the 
associated motion vector accuracy; and 

(c) encoding said best motion vector in the respective accuracy space. 

21 A system for estimating motion vectors in motion-compensated video 
coding by finding a best motion vector for a macroblock, said system comprising: 
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(a) a first encoder for searching a first set of motion vector candidates in 
a grid of sub-pixel resolution of a predetermined square radius^ 
centered on V x to find a "best motion vector V 2 ; 

(b) a second encoder for searching a second set of motion vector 
candidates in a grid of sub-pixel resolution of a predetermined square 
radius centered on V 2 to find a best motion vector V 3 ; and 

(c) . a third encoder for searching a third set of motion vector candidates 

in a grid of sub-pixel resolution of a predetermined square radius 
centered on V 3 to find said best motion vector of said macroblock- 

22. The system of claim 21 wherein said first, second, and third encoders 
are a single encoder. 

3 . Detailed Description of Invention 

BACKGROUND OF THE INVENTION 

The present invention relates generally to a method of compressing or 
coding digital video with bits and, specifically, to an effective method for estimating and 
encoding motion vectors in motion-compensated video coding. 

In classical motion estimation the current frame to be encoded is 
decomposed into image blocks of the same size, typically blocks of 16x16 pixels, called 
"macroblocks." For each current macroblock, the encoder searches for the block in a 
previously encoded frame (the "reference frame") that best matches the current 
macroblock. The coordinate shift between a current macroblock and its best match in the 
reference frame is represented by a two-dimensional vector (the "motion vector") of fee 
macroblock. Each component of the motion vector is measured in pixel units. 

For example, if the best match for a current macroblock happens to be at 
the same location, as is the typical case in stationary background, the motion vector for 
the current macroblock is (0,0). . If the best match is found two pixels to the right and 
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three pixels up from the coordinates of the current macrohlock, the motion vector is (2,3). 
Such motion vectors are said to have integer pixel (or "integer-pel" or "full-pel") 
accuracy, since their horizontal X and vertical Y components are integer pixel values. In 
FIG. 1, the vector V, =(1,1) represents the full-pel motion vector for a given current 

macrohlock. , 

Moving objects in a video scene do not move in integer pixel increments 

from frame to frame. True motion can take any real value along the X and Y directions. 
Consequently, a better match for a current macroblock can often be found by 
interpolating the previous frame by a factor NxN and then searching for the best match in 
the inteipolated frame: The motion vectors can then take values in increments of 1/N 
pixel along X and Y and are said to have 1/N pixel (or " 1/N-pel") accuracy. 

In "Response to Call for Proposals for H.26L," ITU-Telecommunications 
Standardization Sector, Q.15/SG16, doc. Q15-F-1 1, Seoul, Nov, 98, and "Enhancement 
of the Telenor proposal forH.26L," ITU-Telecommunications Standardization Sector, 
Q.1S/SG16, doc. Q15-G-25, Monterey, Feb. 99, Gisle Bjontegaard proposed using 1/3- 
pel accurate motion vectors and cubic-like interpolation for the H26L video coding 
standard (the "Telenor encoder"). To do this, the Telenor encoder interpolates or "up- 
samples ,, the reference frame by 3x3 using a cubic-like interpolation filter. This 
interpolated version requires nine times more memory than the reference frame. At a 
given macroblock, the Telenor encoder estimates the best motion vector in two steps: the 
encoder first searches for the best integer-pel vector and then the Telenor encoder 
searches for the best 1/3-pixel accurate vector V M near V,: Using FIG. 1 as an example, 
a total of eight blocks (of 16x16 pixels) in the 3x3 interpolated reference frame are 
checked to find the best match which, as shown is the block associated to the motion 
vector V, 0 = (VX, VY) = (1+1/3,1). The Telenor encoder has several problems. First, it 
uses a sub-optimal fast-search strategy and a complex cubic filter (at all stages) to 
compute the 1/3-pel accurate motion vectors. As a result, the computed motion vectors 
are not optimal and the memory and computation requirements are very expensive. 
Further, the Telenor encoder uses an accuracy of the effective rate-distortion criteria mat 
is fixed at 1/3-pixel and, therefore, does not adapt to select better motion accuracies. 
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Similarly, the Telenor encoder variable-length code ("VLC") table has an accuracy fixed 
at 1/3 -pixel and, therefore, is not adapted and interpreted differently for different 
accuracies. 

Most known video compression methods estimate and encode motion 
vectors with 1/2-pixel accuracy, because early studies suggested that higher or adaptive 
motion accuracies would increase computational complexity without providing additional 
compression gains. These early studies, however, did not estimate the motion vectors 
using optimized rate-distortion criteria, did not exploit the convexity properties of such 
criteria to reduce computational complexity, and did not use effective strategies to encode 
the motion vectors and their accuracies. 

One such early study was Bemd Giiod's "Motion-Compensating 
Prediction with Fractional-Pel Accuracy," IEEE Transactions on Communications, 
Vol. 41, No. 4, pp. 604-612, April 1993 (the "Girod work"). The Girod work is the first 
fundamental analysis on the benefits of using sub-pixel motion accuracy for video 
coding. Girod used a simple, hierarchical strategy to search for the best motion vector in 
sub-pixel space. He also used simple mean absolute difference ("MAD") criteria to select 
the best motion vector for a given accuracy. The best accuracy was selected using a 
formula that is not useful in practice since it is based on idealized assumptions, is very 
complex, and restricts all motion vectors to have the same accuracy within a jSrarne. 
Finally, Girod focused only on prediction error energy and did not address how to use 
bits to encode the motion vectors. 

Another early study was Smita Gupta's and Allen Gersho's "On Fractional 
Pixel Motion Estimation," Proc. SPJE VCIP, VoL 2094, pp. 408-4 1 9, Cambridge, 
November 1993 (the "Gupta work"). The Gupta work presented a method for computing, 
selecting, and encoding motion vectors with sub-pixel accuracy for video compression* 
The Gupta work disclosed a formula based on mean squared error ("MSE") and bilinear 
interpolation, used this formula to find an ideal motion vector, and then quantized such 
vector to the desired motion accuracy. The best motion vector for a given accuracy was 
found using the sub-optimal MSE criteria and the best accuracy was selected using the 
largest decrease in difference energy per distortion bit, which is a greedy (sub-optimal) 
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criteria. A given motion vector was coded by first encoding that vector with 1/2-pel 
accuracy and then encoding the higher accuracy with refinement bits. Course-to-fine 
coding tends to require significant bit overhead. 

In M On the Optimal Motion Vector Accuracy for Block-Based Motion- 
Compensated Video Coders," Proc. IST/SPIE Digital Video Compression: Algorithms 
and Technologies, pp. 302-314, San Jose, February 1996 (the "Ribas work"), Jordi Ribas- 
Corbera and David L. Neuhoff, modeled the effect of motion accuracy on bit rate and 
proposed several methods to estimate the optimal accuracies that minimize bit rate. The 
Ribas work set forth a fiiU-search approach for computing motion vectors for a given 
accuracy and considered only bilinear interpolation. The best motion vector was found 
by minimizing MSE and the best accuracy was selected using some formulas derived 
from a rate-distortion optimization. The motion vectors and accuracies were encoded 
with frame-adaptive entropy coders, which are complex to implement in real-time 
applications. 

La "Proposal for a new core experiment on prediction enhancement at 
higher bitrates," ISO/IEC JTC1/SC29/WG11 Coding of Moving Pictures and Audio, 
MPEG 97/1827, Sevilla, Feb. 1997 and "Performance Evaluation of a Reduced 
Complexity Implementation for Quarter Pel Motion Compensation," ISO/IEC 
JTC1/SC29/WG11 Coding of Moving Pictures and Audio, MPEG 97/3146, San lose, 
Jan. 1 998, Ulrich Benzler proposed using 1/4-pel accurate motion vectors for the video 
sequence and more advanced interpolation filters for the MPEG4 video coding standard. 
Benzler, however, used the Girod's fast-search technique to find the 1/4-pel motion 
vectors. Benzler did consider different interpolation filters, but proposed a complex filter 
at the first stage and a simpler filter at the second stage and interpolated one macroblock 
at a time. This approach does not require much cache memory, but it is computationally 
expensive because of its complexity and because all motion vectors are computed with 
1/4-pel accuracy for all the possible modes in a macroblock (e.g., 1 6x16, four-8x8, 
sixteen-4x4, etc.) and then the best mode is determined. Benzler used the MAD criteria 
to find the best motion vector which was fixed to 1/4-pel accuracy for the whole 
sequence, and hence he did not address how to select the best motion accuracy. Finally, 
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Benzler encoded the motion vectors with a variable-length code ("VLC") table that could 
be used for encoding 1/2 and 1/4 pixel accurate vectors. 

The references discussed above do not estimate the motion vectors using 
optimized rate-distortion criteria and do not exploit the convexity properties of such 
criteria to reduce computational complexity. Further, these references do not use 
effective strategies to encode motion vectors and their accuracies. 

BRIEF SUMMARY OF THE INVENTION 

One preferred embodiment of the present invention addresses the 
problems of the prior art by computing motion vectors of high pixel accuracy (also 
denoted as "fractional" or "sub-pixel" accuracy) with a minor increase in computation. 

Experiments have demonstrated that, by using the search strategy of the 
present invention, a video encoder can achieve significant compression gains (e.g., up to 
thirty percent in bit rale savings over the classical choices of motion accuracy) using 
similar levels of computation. Since the motion accuracies are adaptivcly computed and 
selected, the present invention may be described as adaptive motion accuracy {"AMA"). 

One preferred embodiment of the present invention uses fast-Search 
strategies in sub-pixel space that smartly searches for the best motion vectors. This 
technique estimates motion vectors in motion-compensated video coding by finding a 
best motion vector for a raacroblock. The first step is searching a first set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermined square radius 
centered on V, to find a best motion vector V 2 . Next, a second set of motion vector 
candidates in a grid of sub-pixel resolution of a predetermined square radius centered on 
V 2 is searched to find a best motion vector V 3 . Then, a third set of motion vector 
candidates in a grid of sub-pixel resolution of a predetermined square radius centered on 
V 3 is searched to find the best motion vector of the macroblock, 

Li aia alternate preferred embodiment the present invention a technique for 
estimating high-accurate motion vectors may use different interpolation filters at different 
stages in order to reduce computational complexity. 

Another alternate preferred embodiment of the present invention selects 
the best vectors and accuracies in a rate-distortion ("KD") sense. This embodiment uses 
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rate-distortion criteria that adapts according to the different motion accuracies to 
determine both the best motion vectors and the best motion accuracies. 

Still further, another alternate preferred embodiment of the present 
invention encodes the motion vector and accuracies with an effective VLC approach. 
This technique uses a VLC table that is interpreted differently at different coding units, 
according to the associated motion vector accuracy. 

The foregoing and other objectives, features, and advantages of the 
invention will be more readily understood upon consideration of the following detailed 
description of the invention, taken in conjunction with the accompanying drawings. 

DETAILED DESCRIPTION OF THE INVENTION 

The methods of the present invention are described herein in terms of the 
motion accuracy being modified at each image block. These methods, however, may be 
applied when the accuracy is fixed for the whole sequence or modified on a frame-by-fiame 
basis. The present invention is also described as using Telenor's video encoders (and 
particularly the Telenor encoder) as described in the Background of the Invention. Although 
described in terms of Telenors video encoders, the techniques described herein are 
applicable to any other motion-compensated video coder. 

Most video coders use motion vectors with half pixel (or 4, l/2-pel M ) accuracy 
and bilinear interpolation. The first version of Telenor's encoder also used 1/2-pel motion 
vectors and bilinear interpolation. The latest version of Telenor's encoder, however, 
incorporated 1/3 -pel vectors and cubic-like interpolation because of the additional 
compression gains. Specifically, at a given macroblock, Telenor's estimates the best motion 
vector in two steps shown in FIG. 2. First, the Telenor encoder searches for the best integer- 
pel vector V! (FIG. 1) 100. Second, the Telenor encoder searches for the best 1/3-pixel 
accurate vector V w (FIG. 1) near V, 102. This second step is shown graphically in FIG. 1 
where a total of eight blocks (each having an array of 16x1 6 pixels) in the 3x3 intcipolated 
reference frame are checked to find the best match. The motion vectors for these eight 
blocks are represented by the eight solid dots in the grid centered on V,. In FIG. 1 the best 
match is the block associated to the motion vector V l/3 = (V x , V Y ) = (1+1/3, 1). 

The technology of the present invention allows the encoder to choose 
between any set of motion accuracies (for example, 1/2, 1/3, and 1/6-pel accurate motion 
vectors) using either a fall search strategy or a fast search strategy. 
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Full-Search AMA Search Strategy 

As shown in FIGS . 3 and 4, in the full-search adaptive motion accuracy • 
("AMA") search strategy the encoder searches all the motion vector candidates in a grid 
of 1/6-pixel resolution and a "square radius" (defined herein as a square block defined by 
a number of pixels up, a number of pixels down, and a number of pixels to both sides) of 
five pixels as shown in FIG. 3 . FIG. 4 shows that the first step of the full-search AMA is 
to search for the best integer-pel vector V, (FIG. 1) 104. In the second step of the full- 
search AMA, the encoder searches for the best 1/6-pixel accurate vector V, /4 (FIG. 3) near 
V, 106. In other words, the full-search AMA modifies Ihe second step of the Telenor's 
process so that the encoder also searches for motion vector candidates in olher sub-pixel 
locations in the velocity space. The objective is to find the best motion vector in the grid, 
i.e., the vector that points to the block (in the interpolated reference frame) lhal best 
matches the current macroblock. Although the full-search strategy is computationally 
complex since it searches 120 sub-pixel candidates, it shows the full potential of this 
preferred method of the present invention. 

A critical issue in the motion vector search is the choice of a measure or 
criterion for establishing which block is the best match for the given macroblock. In 
practice, most methods use either the mean squared error ("MSB") or mean absolute 
difference ('MAD") criteria. The MSE between two blocks consists of subtracting the 
pixel values of the two blocks, squaring the pixel differences, and then taking the 
average. The MAD difference between two blocks is a similar distortion measure, except 
that the absolute value of the pixel differences is computed instead of the squares. If two 
image blocks are similar to each other, the MSE and MAD values will be small. If, 
however, the image blocks are dissimilar, these values will be large. Hence, typical video 
coders find me best match for a macroblock by selecting the motion vector that produces 
either the smallest MSE or the smallest MAD. In other words, the block associated to the 
best motion vector is the one closest to the given macroblock in an MSE or MAD sense. 

Unfortunately, the MSE and MAD distortion measures do not take into 
account the cost in bits of actually encoding the vector. For example, a given motion 
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vector may minimize the MSE, but it may be very costly to encode with bits, so it may 
not be the best choice from an coding standpoint. 

To deal with this, advanced encoders such as those described by Telenor 
use rate-distortion ("RD") criteria of the type "distortion + L*Bits" to select the best 
motion vector. The value of "distortion" is typically the MSE or MAD, "L" is a constant 
that depends on the compression level (i.e., the quantization step size), and "Bits" is the 
number of bits required to code the motion vector. In general, any RD criteria of this 
type would work with the present invention. However, in the present invention "Bits" 
include the bits needed for encoding the vector and those for encoding the accuracy of the 
vector. In fact, some candidates can have several "Bits" values, because they can have 
several accuracy modes. For example, the candidate at location (1/2, -1/2) can be thought 
of having 1/2 or 1/6 pixel accuracy. 

Fast-Search AMA Search Strategy 

As shown in FIGS. 5 and 6, in the fast-search adaptive motion accuracy 
("AMA") search strategy the encoder checks only a small set of the motion vector 
candidates. In the first step of the fast-search AMA, the encoder checks the eight motion 
vector candidates in a grid of 1/2-pixel resolution of square radius 1, which is centered on 
V x 108. V 2 is then set to denote the candidate that has the smallest RD cost (i.e., the best of 
the eight previous vectors and V,) 1 10. Next, the encoder checks the eight motion vector 
locations in a grid of 1/6-pixel resolution of square radius 1 that is now centered on V a 1 12. 
If V* has the smallest RD cost 1 14, the encoder stops its search and selects V 2 as the motion 
vector for the block. Otherwise, V 3 is set to denote the best motion vector of the eight 116. 
The encoder then searches for a new motion vector candidates in the grid of 1/6-pixel 
resolution of square radius 1 that is centered on V 3 1 1 8. It should be noted that some of the 
candidates in this grid have already been tested and can be skipped. The candidate with the 
smallest RD cost in this last step is selected as the motion vector for the block 120. 

Experimental data has shown that, on average, this simple fast search strategy 
typically checks the RD cost of about eighteen locations in sub-pixel space (ten more than 
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Telenor's search strategy), and hence the overall computational complexity is only 
moderately increased. •« 

The experimental data discussed below in connection with FIGS. 8-18 show 
that there is practically no loss in compression performance from using this fast-search 
version of AMA This is because the fast-search AMA search strategy exploits the convexity 
of the "distortion +- L*Bits" curve (c.f., "distortion" is known to be convex), by creating a 
path that smartly follows theKD cost from higher to lower levels. 

Alternate embodiments of the invention replace one or more of the steps 108- 
120. These embodiments have also been effective and have further reduced the number of 
motion vector candidates to check in the sub-pixel velocity space. 

FIG. 7, for example, checks candidates of 1/3-pel accuracy. In this 
embodiment step 1 12 is replaced by one of three possible scenarios. First, if the best motion 
vector candidate from step 1 1 0 is at the center of V, (the "integer-pel vector") 130, then the 
encoder checks three candidates of 1/3-pel accuracy between the center vector and the 1/2- 
pel location with the next lowest RD cost 1 32. Second, if the best motion vector candidate 
from step 110 is a comer vector 134, then, the encoder checks the four vector candidates of 
1/3-pel accuracy that are closest to such comer 136. Third, if the best motion vector 
candidate from step 110 is between two comers 138, then, the encoder determines which of 
these two comers has lower RD cost and checks the four vector candidates of 1/3-pel 
accuracy that are closest to the line between such comer and the best candidate from step 1 10 
140. It should be noted that in implementing this process step 138 may be unnecessary 
because if V2 is neither at the center or a comer vector, then it would necessarily be between 
two comers. If the encoder is set to find motion vectors with 1/3-pixcl accuracy, FIG. 7 
could be modified to end rather than continuing with step 114. 

Computation And Memory Savings 

Because step 108 checks only motion vector candidates of 1/2-pixel 
accuracy, the computation and memory requirements for tlie hardware or software 
implementation are significantly reduced. To be specific, in a smart implementation 
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embodiment of this fast-search the reference frame is interpolated by 2x2 in order to obtain 
'the RD costs for the 1/2-pel vector candidates. A significant amount of fast (or cache) 
memory for a hardware or software encoder is saved as compared to Telenor's approach that 
needed to interpolate the reference frame by 3x3 . In comparison to the Telenor encoder, this 
is a cache memory savings of 9/4 or a factor of 2.25. The few additional interpolations can 
be done later on a block-by-block basis. 

Additionally, since the interpolations in step 108 are used to direct the search 
towards the lower values of the RD cost function, a complex filter is not needed for these 
interpolations. - Accordingly, computation power may be saved by using a simple bilinear 
filter for step 108. 

Also, other key coding decisions such as selecting the mode of a macroblock 
(e.g., 1 6x1 6, four-8x8, etc.) can be done using the 1/2-pel vectors because such decisions do 
not benefit significantly from using higher accuracies. Then, the encoder can use a more 
complex cubic filter to interpolate the required sub-pixel values for the few additional vector 
candidates to check in the remaining steps. Since the macroblock mode has already been 
chosen, these final interpolations only need to be done for the chosen mode. 

Use of multiple-filters obtained computation savings of over twenty 
percent in running time on a Sparc Ultra 10 Workstation in comparison to Telenor's 
approach, which uses a cubic interpolation all the time. Additionally, the fast-memory 
requirements were reducedby nearly half. Also/there was little or rio loss in 
compression performance. Comparing oneprefeired embodiment of the fast-search, 
Benzlcr's technique requires about 70 interpolations per pixel in the Telenor encoder and 
the present invention requires only about 7 interpolations per pixel. 

Coding The Motion Vector And Accuracies With Bits 

Once the best motion vector and accuracy are deteimincd, the encoder 
encodes both the motion vector and accuracy values with bits. One approach is to encode 
the motion vector with a given accuracy (e.g., half-pixel accuracy) and then add some extra 
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bits fox refining the vector to the higher motion accuracy. This is the strategy suggested by 
B. Girod, but it is sub-optimal in a rate-distortion sense. 

In one preferred embodiment of the present invention, the accuracy of the 
motion vector for a macroblock is first encoded using a simple code such as the one given 
in Table 1 . Any other table with code lengths { 1, 2, 2} could be used as well. The bit rate 
could be further reduced using a typical DPCM approach. 



Code 


Motion. 
Accuracy 


I 


1/2-pel 


. 01 


1/3-pel 


11 


1/6-pel 



Table 1 . VLC table to indicate the accuracy 
mode for a given macroblock. 
Next, the value of the vector/s in the respective accuracy space is encoded. These bits can 
be obtained from entries of a single VLC table such as the one used in the H26L codec . The 
key idea is that these bits are interpreted differently depending on the motion accuracy for 
the macroblock. For example, if the motion accuracy is 1/3 and the code bits for the X 
component of the difference motion vector are 000011, the X component of the vector is 
Vx= 2/3. If the accuracy is 1/2, such code corresponds to Vx= 1 . 

Compared to the Benzler method for encoding the motion vectors with a 
variable length code ("VLC") table that could be used for encoding 1/2 and 1/4 pixel 
accurate vectors, the method of the present invention can be used for encoding vectors of 
any motion accuracy and the table can be interpreted differently at each frame and 
macrobloclc Further, the general method of the present invention can be used for any 
motion accuracy, not necessarily those that are multiples of each other or those that are of 
the type t/n (with n an integer). The number of increments in the given sub-pixel space is 
simply counted and the bits in the associated entry of the table is used as the code. 
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From the decoder's viewpoint, once the motion accuracy is decoded, the 
motion vector can also be easily decoded. After that, the associated block in the previous 
frame is reconstructed using a typical 4-tap cubic interpolator. There is a different 4-tap 
filter for each motion accuracy. 

The AMA does not increase decoding complexity, because the number of 
operations needed to reconstruct the predicted block are the same, regardless of the 
motion accuracy. 

Experimental Results 

' TIGS. 8-18 show test results of the Telenor encoder codec with and without 
AMA in a variety of video sequences, resolutions, and frame rates, as described in Table 2. 
These figures show rate-distortion ("RD") plots for each case. The "Anchor" curve shows 
RD points from optimized H.263+ (FIGS. 8 and 9 only). The 'Telenor V2+V curve shows 
Telenor with 1/2-pel vectors and bilinear interpolation (the "classical case"). The "Telenor 
1/3" curve shows the current Telenor proposal (the "Telenor encoder"). The 
"Telenor+AMA+c" curve shows the Telenor encoder with the full-search strategy of the 
present invention. The " Telenor h-FSAMA+c", as shown in FIGS. 15-17, shows the current 
Telenor encoder with the fast-search strategy. (Unless otherwise specified, the full-search 
version of AMA was the encoder strategy used in the experiments.) All of the test results 
were cross-checked at the encoder and decoder. These results show that with AMA the gains 
in peak signal-to-noise ratio ("PSNR") can be as high as 1 dB over H26L, and even higher 
over the classical case. 



Video sequence 


FIG.# 


Resolution 


Frame rate 


Container 


FIG. 8 


QCIF 
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News 


FIG. 9 


QCIF 
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Mobile 


FIG. 10 


QCIF 
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FIG. 11 




1 5 


Garden 


FIG. 12 


QCEF 


15 


Tempete 


FIG. 13 


SIF 


15 




FIG. 14 


QCIF 


15 


Paris Snaked 


FIG. 15 


QCIF 


10 



The video sequences are commonly used by the video coding community, 
except for "Paris Shaked." The latter is a synthetic sequence obtained by shifting the well- 
known sequence 'Tans" by a motion vector whose X and Y components take a random value 
within [-1,1]. This synthetic sequence simulates small movements earned by a hand-held 
camera in a typical video phone scene. 

Comparison Of Full-Search And Fast-Search AMA 

The experimental results shown in FIGS. 16 and 17 demonstrate that the 
encoder performance with fast-search ("Telenor FSAMA+c") and full-search ("Telenor 
AMA+c") strategies for AMA is practically the same. This is true because the fast-search 
strategies exploit the convexity of the KD cost cxirve in the sub-pixel velocity space. In other 
words, since the shape of the RD cost follows a smooth convex curve, its minimum should 
be easy to find with some smart fast-search schemes that descend down the curve. 



Combining AMA And Multiple Reference Frames 

In the plot shown in FIG. 18, the curves labeled "lr^ used only one reference 
frame for the motion compensation, so these curves are the same as those presented in FIG. 
1 0. The curves labeled "5T used five reference frames. 

The experiments show that the gains with AMA add to those obtained using 
multiple reference frames. The gain from AMA in the onc-rcfcrence case can be measured 
by comparing the green and pink curves, and the gain in the five-reference case can be 
measured between the blue and red curves. 
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It should be noted that the present invention may be implemented at the 
frame level so that different frames could use different motion accuracies, but within a frame 
- all motion vectors would use the same accuracy. Preferably in this embodiment the motion 
vector accuracy would then be signaled only once at the frame layer. Experiments have 
shown that using the best, fixed motion accuracy for the whole frame should also produce 
compression gains as those presented here for the macroblock-adaptive case. 

In another frame-based embodiment the encoder could do motion 
compensation on the entire frame with the different vector accuracies and then select the best 
accuracy according to the ED criteria. This approach is not suitable for pipeline, one-pass 
encoders, but it could be appropriate for software-based or more complex encoders. Still 
another fame-based embodiment the encoder could use previous statistics and/or formulas 
to predict what will be the best accuracy for a given frame (e.g., the formulas in set forth in 
the Ribas work or a variation thereof can be used). Ibis approach would be well-suited for 
one-pass encoders, although the performance gains would depend on the precision of the 
formulas used for the prediction. 

The terms and expressions which have been employed in the foregoing 
specification are used therein as terms of description and not of limitation, and there is no 
intention, in the use of such terms and expressions, of excluding equivalents of the 
features shown and described or portions thereof, it being recognized that the Bcope of the 
invention is defined and limited only by the claims that follow. 

Brief Description of Drawings 

FIG. 1 is a diagram of an exemplary full-pel and 1/3-pel locations in 

velocity space. 

FIG. 2 is a flowchart illustrating a prior art method for estimating the best 

motion vector. 

FIG. 3 is a diagram of an exemplary location of motion vector candidates 
for full-search in sub-pixel velocity space. 
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FIG. 4 is a flowchart illustrating a full-search preferred embodiment of the 
method for estimating the best motion vector of the present invention, 

FIG, 5 is a diagram of an exemplary location of motion vector candidates 
for fast-search in sub-pixel velocity space, 

FIG. 6 is a flowchart illustrating a fest-search preferred embodiment of the 
method for estimating the best motion vector of the present invention. 

FIG. 7 is a detail flowchart illustrating an alternate preferred embodiment 

of step 114 of FIG. 6. 

FIG. 8 is a graphical representation of experimental performance results of 
the Telenor encoder with and without AMA in the "Container" video sequence, with 
QCIF resolution, and at the frame rate of 10 frames per second. 

FIG. 9 is a graphical representation of experimental performance results of 
the Telenor encoder with and without AMA in the "News" video sequence, with QCIF 
resolution, and at the frame rate of 10 frames per second. 

FIG. 10 is a graphical representation of experimental performance results 
of the Telenor encoder with and without AMA in the "Mobile" video sequence, with 
QCIF resolution, and at the frame rate of 10 frames per second. 

FIG. 1 1 is a graphical representation of experimental performance results 
of the Telenor encoder with and without AMA in the "Garden" video sequence, with SIF 
resolution, and at the frame rale of 15 frames per second. 

FIG. 12 is a graphical representation of experimental performance results 
of the Telenor encoder with and without AMA in the "Garden" video sequence, with 
QCIF resolution, and at the frame rate of 15 frames per second. 

FIG. 13 is a graphical representation of experimental performance results 
of the Telenor encoder with and without AMA in the "Tempete" video sequence, with 
SIF resolution, and at the frame rate of 15 frames per second. 

FIG. 14 is a graphical representation of experimental performance results 
of the Telenor encoder with and without AMA in the "Tempete" video sequence, with 
QCIF resolution, and at the frame rate of 15 frames per second. 

FIG. 15 is a graphical representation of experimental performance results 
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of the Telenor encoder with and without AMA in the "Paris shaked" video sequence, with 
QCIF resolution, and at the frame rate of 1 0 frames per second. . 

FIG. 16 is a graphical representation of experimental performance results 
of fast-search ("Telenor FSAMA+c") and full-search ("Telenor AMA+c") strategies in 
the "Mobile" video sequence, with QCEF resolution, and at the frame rate of 10 frames 
per second. 

FIG, 17 is a graphical representation of experimental performance results 
of fast-search ("Telenor FSAMA+c") and foil-search ("Telenor AMA+c") strategies in 
the "Container" video sequence, with QCIF resolution, and at the frame rate of 10 frames 
per second. 

FIG. 18 is a graphical representation of experimental performance results 
of tests using only one reference frame for motion compensation as compared to tests 
using multiple reference frames for motion compensation the in the "Mobile" video 
sequence, with QCIF resolution, and at the frame rate of 10 frames per second. 
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The encoder searches 
for the best integer-pel vector V1 . 



hi • 

100 



The encoder searches for 
the best 1/3-pixel accurate 
vector V1 /3 near V1 . 
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FIG.3 



o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


• 


o 


• 


o 


• 


o 


• 


o 


• 


o 


o 


o 


ffi 


o 


o 




o 


o 




o 


o 


o 


• 


o 


• 


o 


• 


o 


• 


o 


• 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


• 


ID 


• 


o 


@ 


o 


• 


© 


• 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


• 


o 


• 


o 


• 


o 


• 


o 


• 


o 


o 


o 


o 


o 


o 


ft 


o 


o 




o 


o 


o 


• 


o 


• 


o 


• 


o 


• 




• 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 



© 


1-pel 




1/2-pel 


• 


1/3-pel 


o 


1/6-pel 



1/6 pixel 



(38) 



4WF13 



FIG.4 



The encoder searches 
for the best integer-pel vector V1 . 
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The encoder searches for 
the best 1/6-pixel accurate 
vector V1 /6 near V1 . 
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FIG.5 
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FIG.6 



Checking the eight motion vector candidates in a grid of 
1/2-pixel resolution of radius 1 which is centered on V1. 
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Denoting the candidate that has the smallest RD cost as V2. ^ 
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Checking the eight motion vector locations.in a grid of 
1/6-pixel resolution of radius 1 that is now centered on V2. 



Yes 
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Denoting the best motion vector of the eight as V3. >^ 
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Checking the new motion vector candidates in 
the grid of 1/6-pixel resolution of radius 1 that Is 
centered on V3. Observe that some of the candidates in 
the grid have already been tested and can be skipped. 
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Selecting the candidate with the smallest 
RD cost as the motion vector for the block. 
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FIG.7 
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Checking three candidates 
of 1/3-peI accuracy between the 
center vector and the 1/2-pel 
location with the next lowest RD cost 
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Checking the four vector 
candidates of 1/3-pel accuracy 
that are closest to such corner. 
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Yes 



Determining which of the two corners 

has lower RD cost and checking 
the four vector candidates of 1/3-pel 
accuracy that are closest to the line 
between such corner and V2. 



140 




114 



(41) WKW13-189934 

FIG.8 
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FIG.10 
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FIG.12 



R-D Perioimancc : Garden Qcii TKps 




(46) »W¥13-189934 

FIG.13 
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FIG.16 
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FIG.17 
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ethods for motion estimation with adaptive motion accuracy of the 
include several techniques for computing motion vectors of high pixel 
accuracy with a minor increase in computation. One technique uses fast-search strategies 
in sub-pixel space that smartly searches for the best motion vectors. An alternate 
technique estimates high-accurate motion vectors using different interpolation filters at 
different stages in order to reduce computational complexity. Yet another technique uses 
rate-distortion criteria that adapts according to the different motion accuracies to 
determine both the best motion vectors and the best motion accuracies. Still another 
technique uses a VLC table that is interpreted differently at different coding units, 
according to the associated motion vector accuracy. 

2. Representative Drawing 
Fig. 6 
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